Automatic Discovery of Complex Causality By
نویسنده
چکیده
This study entails the understanding of and the development of a computational method for automatically extracting complex expressions in language that correspond to event to event sequential relations in the real world. We here develop component procedures of a system that would be capable of taking raw linguistic input (such as those from narrative writings or social network data), and find real-world semantic relations among events. Such an endeavor is applicable to many types of sequential relations, for which we use causality as a case study, both for its importance as a prominent type of sequential relation between events, as well as for its general prevalence in natural language. But we also demonstrate that the idea is also applicable in principle to other major types of event to event relations, such as reciprocity. The study primarily focuses on those types of causalities that contain complex structures and require in-depth linguistic analyses to discover and extract. Designing an automated method for the extraction of structurally complex causal expressions entails methodologies and theories that are beyond conventional methods used in computational semantics. The classes of adjunctive causal structure, and embedded causal structure are types that are hard to access using traditional methods, but more amenable for methods developed in this study. The principal procedures employed for the extraction of these are a heavily modified form of Hidden Markov Model (HMM), which we use to deal with causal structures that have sequentially complex makeup. We also designed a highly modified Genetic Algorithm (GA) adapted for embedded context-free structures, used to rank and extract those causal structures that have deep embedding at the syntax-semantics interface. These will be reformulated, augmented, and explored in depth. With these methods using unsupervised and semi-supervised learning, we were able to obtain reasonable results in terms of discrimination of causal pairs 〈ei, ej〉 pairs and some longer chains of causation from corpora. From these results, we were also able to perform additional linguistic analysis over their theoretical semantic structure, and observe aspects of each that allows us to sub-classify the relations according to standard ideas in formal logic as well as from behavioral psychology. These methods would be critical to a system
منابع مشابه
Automatic Discovery of Technology Networks for Industrial-Scale R&D IT Projects via Data Mining
Industrial-Scale R&D IT Projects depend on many sub-technologies which need to be understood and have their risks analysed before the project can begin for their success. When planning such an industrial-scale project, the list of technologies and the associations of these technologies with each other is often complex and form a network. Discovery of this network of technologies is time consumi...
متن کاملZ-Cognitive Map: An Integrated Cognitive Maps and Z-Numbers Approach under Cognitive Information
Usually, in real-world engineering problems, there are different types of uncertainties about the studied variables, which can be due to the specific variables under investigation or interaction between them. Fuzzy cognitive maps, which addresses the cause-effect relation between variables, is one of the most common models for better understanding of the problems, especially when the quantitati...
متن کاملCMDTS: The Causality-based Medical Diagnosis and Treatment System
Our medical world is replete with clinical data but this data is rarely automatically exploited for bringing more health to our society. Many researches have been conducted in Medical Data Mining, but almost all of them have focused on diagnosing the diseases not treating the patients. In this paper we propose the Causality-based Medical Diagnosis and Treatment System, which can be used to diag...
متن کاملCMDTS: The Causality-based Medical Diagnosis and Treatment System
Our medical world is replete with clinical data but this data is rarely automatically exploited for bringing more health to our society. Many researches have been conducted in Medical Data Mining, but almost all of them have focused on diagnosing the diseases not treating the patients. In this paper we propose the Causality-based Medical Diagnosis and Treatment System, which can be used to diag...
متن کاملAb Initio Study of Vinblastine-Tubulin Anticancer Complex
Vinblastine is an important anticancer agent known to diminish microtubule assembly. Ab initio calculations are applied to examine the structural properties and different energies of vinblastine-tubulin complex in different dielectric constants and temperatures. The aims of this work are discovery the best optimized structure and thermodynamic properties of vinblastine-tubulin complex ...
متن کاملA Knowledge-Intensive Approach for Semi-automatic Causal Subgroup Discovery
This paper presents a methodological view on knowledge-intensive causal subgroup discovery implemented in a semi-automatic approach. We show how to identify causal relations between subgroups by generating an extended causal subgroup network utilizing background knowledge. Using the links within the network we can identify causal relations, but also relations that are potentially confounded and...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2015